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ABSTRACT 

Assuming  a  zero  order  process,  the  beta-binomial  distribution  model 
is  often  used  to  describe  consumers'  brand  switching  behavior.  A  variety 
of  methods  have  been  used  to  estimate  the  parameters  of  the  beta-binomial 
distribution,  each  of  which  assumes  fixed  number  of  purchase  occasions 
across  households.  Consumer  panel  data,  however,  are  typically  available 
for  a  fixed  duration  of  time  (say,  a  year)  during  which  consumers  in- 
variably make  different  number  of  purchases  in  the  product  class.  The 
purpose  of  this  paper  is  to  present  a  method  for  estimating  zero  order 
models  from  panel  data  containing  different  number  of  purchase  records 
per  household.  Empirical  data  are  presented  to  demonstrate  the  efficiency 
of  this  estimation  method. 
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INTRODUCTION 

Marketing  researchers  frequently  use  two  different  approaches  to 
present  analytical  results  from  consumer  panel  data:  (i)  the  negative 
binomial  distribution  (NBD)  model  (see  [2]),  and  (ii)  the  beta-binomial 
distribution  (BBD)  model  (see  [5,8]).  The  first  method  involves  the 
construction  of  a  frequency  distribution  of  purchases  in  the  sample  who 
buy  0,1,2,...,  or  in  general,  r  units  of  an  item  (  a  particular  pack- 
size  of  a  brand)  in  some  specified  time  period.  This  frequency  distribution 
information  is  then  used  to  obtain  two  key  measures  of  market  response  to 
this  item,  namely,  penetration  and  purchase  frequency.  Penetration  is  de- 
fined as  the  proportion  of  people  who  buy  this  item  at  least  once  in  the 
given  time  period.  Purchase  frequency  is  the  average  number  of  times  these 
buyers  purchase  the  item  in  the  specified  time  period  (see  [2]  for  further 
details).  Assuming  a  zero  order  process,  the  BBD  model  is  used  to  report 
the  proportion  of  households  in  the  sample  who  buy  a  brand  a  given  number  of 
times  out  of  a  fixed  number  of  purchase  occasions.  Additionally,  the  BBD 
model  is  employed  to  describe  consumers'  brand  switching  behavior.  Since 
each  person  is  defined  by  a  single  parameter,  p.  these  zero  order  models  are 
completely  defined  by  specifying  the  parameters  of  the  mixing  beta  distribution. 
The  conditional  and  unconditional  switching  and  repeat  purchase  probabilities 
representing  measures  of  market  response  to  the  brand  can  be  easily  obtained 
from  a  knowledge  of  these  parameters  (see  [4,5,8]).  A  variety  of  methods 
have  been  used  to  estimate  the  parameters  of  the  BBD  model,  each  of  which 


assumes  fixed  number  of  purchase  occasions  across  consumers  (see  [1,3,4,7]), 
Consumer  panel  data,  however,  are  commonly  available  for  fixed  durations  of 
time  (say,  a  year)  during  which  households  invariably  make  a  different  number 
of  purchases  in  the  product  class.  The  purpose  of  this  paper  is  to  present 
a  method  for  estimating  zero  order  models  from  panel  data  containing 
differing  number  of  purchase  records  per  household.  Empirical  data  are  pre- 
sented to  demonstrate  the  efficiency  of  this  estimation  method. 

THE  METHOD 
In  the  BBD  model,  each  consumer  is  assumed  to  buy  a  given  brand  (say. 
Brand  1)  with  probability,  p,  or  some  brand  in  the  aggregate  "all  other" 
class  with  probability,  1-p.  Further,  each  consumer  is  assumed  to  make  a 
fixed  number  of  purchases  in  the  product  class.  The  model  allows  for 
heterogeneity  in  the  population  by  letting  the  p  values  differ  across  in- 
dividuals according  to  a  beta  distribution.  The  marginal  distribution  of 
the  number  of  successes  (i.e.,  purchases  of  Brand  1),  x,  on  k  Bernoulli 
trials  (i.e.,  purchases  of  the  product  class)  is  then  beta-binomial  with 

P(x|a,B,k)  =  (!;)  ^-^^^^fj^  ,  0<p<Ua,B>0,    (1) 

where  a  and  B  are  the  parameters  of  the  beta  distribution  and  B(.)  denotes 
the  beta  function.  The  mean  and  variance  of  the  beta-binomial  distribution 
are  given  by 

E[x]  =  k^  , 

VaK-fvl  -  kag(k+a+6)  /px 

^^•^LX]  -  (a+B7(a+B+l)  •  ^2^ 
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As  indicated  earlier,  a  variety  of  methods  have  been  used  to  estimate  the 
parameters  of  the  beta-binomial  distribution,  each  of  which  assumes  a  fixed 
number  of  trials  across  consumers  (see  [1,3,4,7]).  The  maximum  likelihood 
estimation  procedure  presented  in  Kalwani  and  Morrison  [3,4]  can  be  extended 
to  the  case  of  variable  number  of  trials  per  household.  Denote  the  number 
of  purchase  records  of  household  j  by  k.,  where  j=l,2,...N.  Then,  assuming 
independence  between  k.  and  household  j's  probability  to  purchase  Brand  1, 
the  probability  of  household  j  making  x.  purchases  of  Brand  1  is  given  by 

(^   B(a+Xj,6+kj-Xj) 

P(x.|a,e,k.)  =  {  ^) •  ^^^ 

^  ^  Xj      B(a,3) 


The  likelihood  function  for  the  N  households  can  be  written  as  a  product  of 
N  probabilities  using  equation  (3).  Ignoring  the  constant  terms,  this 
likelihood  function  for  the  overall  sample  data  is  given  by 


N  B(a+x.,3+k.-x.) 

i!<(x.'s|a,6,k.'s)  =  TT ^ ^-^- 

^  ^  j=l    B(a,3) 


Taking  logarithms  and  expressing  the  beta  functions  in  terms  of  the  gamma 
functions,  the  above  expression  simplifies  to  the  following  log-likelihood 
function 


N 


L(x.'s|a,3,k.'s)  =  I 


rXj-1  k-Xj-1  k-1 

I  log(a+r)  +  I  log  (3+r)   -  Jlog(a+3+r) 
Lr=0  r=0  r=0 


(4) 
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In  many  zero  order  applications,  the  knowledge  of  a  brand's  market  share 
y=a/{a+g)  and  loyalty  index,  (f)=l/(l+a+3)  is  adequate  (see  [4]).  The 
conditional  as  well  as  unconditional  repeat  and  switching  probabilities 
can  be  obtained  from  a  knowledge  of  these  two  parameters.  It  is  shown 
in  [4]  that  the  transformed  parameters,  y  and  4),  are  computationally 
easier  to  estimate  than  the  original  parameters  since  their  maximum  like- 
lihood estimates  have  smaller  standard  errors  than  the  maximum  likelihood 
estimates  of  a  and  6.  Substituting  for  a  and  3  in  terms  of  y  and  (}), 
the  log  likelihood  function  in  equation  (4)  can  be  rewritten  as: 


N 
L(x.'sly,(j),k.'s)  =  I 


rxj-1  k.-x.-l 

I   log(y(l-(}))+r(}))+l  log((l-y)(l-(j))+r({)) 
Lr=0  r=0 


■  I   log(l-(})+r<l)) 
r=0 


(5) 


The  maximum  likelihood  estimates  of  y  and  cf)  (or  a  and  3)  can  be  obtained  by 
maximizing  the  log-likelihood  function  in  equation  (5)  (or  equation  (4)) 
through  numerical  optimization. 

Another  way  of  representing  the  log-likelihood  function  in  equation 
(5)  (or  equation  (4))  involves  classifying  the  households  in  the  panel 
according  to  number  of  purchases  of  the  product  class,  k,  where  k=l,2,...K. 
The  log-likelihood  function  in  equation  (5)  now  becomes 


K 


L(nNy,ct),k's)  =  I 
^  k=l 


;x-i 


k-x-l 


I     u^]l   log(y(l-(t))+r4.)  +  I   log((l-y)(l-(l))+r<t.) 


x=0  ^W=0 


k-1 


r=0 


-  I   log(l-4)+r(l)) 
r=0        ' 


(6) 
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1^ 
where  n  is  the  number  of  consumers  who  make  x  purchases  of  Brand  1  given 

that  they  make  a  total  of  k  purchases  in  the  product  class.  The  numerical 
optimization  procedure  based  on  maximizing  the  log-likelihood  function  in 
equation  (6)  is  computationally  much  faster  than  that  based  on  equation 
(5)  since  the  inside  expression  containing  the  three  summations  is  es- 
timated many  fewer  times.  Empirical  data  are  presented  below  to  illustrate 
the  application  of  the  above  estimation  method. 

ILLUSTRATIVE  APPLICATION 
The  data  presented  below  are  from  an  MRCA  (Market  Research  Corporation 
of  America)  consumer  panel.  They  represent  margarine  purchases  of  2496  house- 
holds during  the  fourth  quarter  of  1976.  The  frequency  distribution  denoting 
the  number  of  households  with  margarine  purchases  ranging  from  1  to  20  is 
displayed  in  Table  1.  Examination  of  Table  1  reveals  that  methods  based  on 

INSERT  TABLE  1  HERE 

fixed  number  of  purchases  per  household  would  be  inefficient  in  that  they 
would  ignore  considerable  purchase  information.  Consider  for  instance,  the 
commonly  used  sequence  length  of  5  purchase  records.  Estimation  methods  based 
on  fixed  number  of  purchase  occasions  would  utilize  only  five  purchase  records 
of  each  of  1161  households  (or  46.5%  of  2496  households)  who  made  more  than 
five  purchases  of  margarine.  This  would  result  in  the  exclusion  of  two 
types  of  purchase  information:  (i)  all  purchase  records  of  1335  (=2496-1161) 
households  with  four  or  fewer  purchases  of  margarine,  and  (ii)  a  number  of 
purchase  records  of  those  households  with  more  than  five  purchases  of  margarine. 
The  maximum  likelihood  estimation  method  presented  in  this  paper  on  the  other 
hand,  would  utilize  all  purchase  records  of  all  the  2496  households  in  the 
panel.  Does  this  full  utilization  of  purchase  information  translate  into 
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increased  efficiency  for  the  estimation  method  based  on  variable  number  of 
purchase  records  per  houshold?  If  yes,  how  much  is  the  gain  in  efficiency? 

Recall  that  under  rather  general  regularity  conditions  (see  [6, 
pp.  295-302]),  maximum  likelihood  estimates  are  best  asymptotically  normal 
(BAN).  That  is,  they  are  consistent,  asyptotically  normal,  and  asymptotically 
efficient.  It  is  shown  in  [6]  that  BAN  estimators  obey  the  "inverse  square 
root  of  n  relationship";  that  if,  for  large  sample  sizes,  standard  error  of 
a  BAN  estimator  is  reduced  by  l//n  with  an  n-fold  increase  in  sample  size. 
Assuming  that  the  maximum  likelihood  functions  presented  in  this  paper 
satisfy  the  regularity  conditions  discussed  in  [6],  the  standard  errors  of 
maximum  likelihood  estimates  using  the_estimation  method  based  on  variable 
number  of  purchase  records  would  be^  —  times  the  standard  errors  of 
maximum  likelihood  estimates  based  on  fixed  number  of  purchase  records  per 
household  where  n^  and  n,,  denote  the  corresponding  sample  sizes. 

What  are  the  values  of  np  and  n^  for  the  illustrative  application 
presented  in  the  section?  The  answer  to  this  question  is  not  clear  cut. 
One  set  of  values  can  be  obtained  by  aggregating  households  with  their 
purchase  records  to  obtain  total  number  of  purchase  records  being  utilized 
for  parameter  estimation.  Thus,  the  estimation  method  based  on  five  purchase 
occasions  per  household  utilizes  5805  (=  1161  X  5)  purchase  records  while 
the  estimation  method  which  allows  for  variable  number  of  margarine  purchase 
records.  This  approach  to  computation  of  sample  sizes  would  imply  a  31% 

i.e.,  1  -J] 2236  ji^eduction  in  the  standard  error  of  estimate.  The  inadeq- 
uacy of  this  approach  becomes  evident  when  one  considers  the  extreme  case 
of  a  sample  of  one  household  with  a  very  large  number  of  purchase  records. 
Such  an  extreme  case  provides  no  information  on  heterogeneity  in  brand 
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purchase  probabilities. 

The  second  set  of  values  can  be  obtained  by  weighting  each  household 
equally  irrespective  of  the  number  of  its  margarine  purchases.  Recall  that 
the  parameters  that  are  being  estimated  -  a,  g  or  y,  (f>  -  refer  to  the  mixing 
distribution  of  purchase  probabilities  (assumed  to  be  beta  distribution 
in  this  paper).  Hence,  margarine  purchases  for  a  given  household  represent 
replications  and  the  aggregation  of  households  forms  the  sample  size.  This 


approach  to  the  computation  of  sample  sizes  would  imply  a  32%  (i.e.,  1  -  j-^^g^ 
reduction  in  the  standard  error  of  estimate. 

It  should  be  noted  that  the  lack  of  a  clear  cut  approach  to  the 
computation  of  sample  size  does  not  question  the  very  claim  of  improved 
efficiency  with  the  use  of  the  new  estimation  method.  It  merely  suggests 
that  the  exact  gain  in  efficiency  will  depend  on  the  interpretation  of  sample 
size.  Note,  that  simulated  purchase  data  can  be  used  to  obtain  accurate 
estimates  of  reductions  in  the  standard  errors  of  estimates. 

The  estimation  procedure  presented  above  was  employed  to  obtain 
maximum  likelihood  estimates  of  a,  B  and  p,  <t>   for  Parkay  margarine.  The 
results  obtained  therefrom  are  given  below: 

a  =  0.28677,  B  =  2.09998, 

M  =  0.12015  i   =  0.29456. 
Since  maximum  likelihood  estimates  are  invariant,  we  can  check  the  accuracy 
of  the  numerical  optimization  procedure  by  comparing  a/(a+3)  with  \x   and 
l/(l+a+6)  with  4).  The  results  of  this  comparison  are  given  below: 

^  =  0.12015,   -4-r  =  0.29527. 
^"■^  l+a+3 

and  they  reveal  accuracy  to  the  third  decimal  place. 


CONCLUSION 


The  methods  used  to  estimate  zero  order  models  assume  fixed  number 
of  purchase  occasions  across  households.  Panel  data,  however,  are 
normally  available  for  a  fixed  duration  of  time  and  contain  different 
number  of  purchase  records  per  household.  In  this  paper,  we  presented 
a  maximum  likelihood  method  for  estimating  the  parameters  of  zero  order 
models  given  variable  number  of  purchase  occasions  per  household. 
Empirical  data  from  a  MRCA  panel  were  presented  to  illustrate  the 
application  of  this  estimation  method. 


FOOTNOTE 


In  zero  order  models  each  consumer  is  assumed  to  pur- 
chase a  given  brand  (say.  Brand  1)  with  probability, 
p,  and  Brand  0  representing  the  aggregate  "all  other" 
class  with  the  complementary  probability,  1-p.  These 
models  allow  for  heterogeneity  in  the  population  by 
letting  the  p  values  differ  across  individuals. 


-10- 


Table  1 


NUMBER  OF  HOUSEHOLDS  WITH  DIFFERENT  NUMBER 
OF  PURCHASES  OF  MARGARINE 


NUMBER  OF 

NUMBER  OF 

NUMBER  OF 

NUMBER  OF 

PURCHASES  OF 

HOUSEHOLDS 

PURCHASES  OF 

HOUSEHOLDS 

MARGARINE 

MARGARINE 

1 

329 

11 

57 

2 

371 

12 

50 

3 

344 

13 

30 

4 

291 

14 

16 

5 

271 

15 

11 

6 

215 

16 

8 

7 

177 

17 

4 

8 

138 

18 

2 

9 

105 

19 

2 

10 

74 

20 

1 

Total  Number  of  Households  =  2496. 

Total  Number  of  Purchase  Records  =  12,236. 
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